Name | Version | Summary | date |
pyvisionai |
0.3.1 |
A Python library for extracting and describing content from documents using Vision LLMs |
2025-02-22 22:21:47 |
aspose-cells-gridjs-net-python |
25.2.0 |
a lightweight, scalable, and customizable toolkit that provides cross-platform web applications, enables convenient development for editing or viewing Excel/Spreadsheet files, offers simple deployment, and provides easy-to-use APIs. |
2025-02-18 09:39:00 |
aspose-cells-python |
25.2.0 |
A powerful library for manipulating and converting Excel (XLS, XLSX, XLSB), CSV, ODS, PDF, JSON, JPG, PNG, BMP, EMF, SVG and HTML files. |
2025-02-17 02:08:15 |
tinbox |
0.1.0 |
A CLI translation tool using LLMs for document translation |
2025-02-16 19:16:56 |
wdoc |
2.5.7 |
A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!) |
2025-02-13 23:04:06 |
docling |
2.21.0 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-02-10 11:44:35 |
subtitles2text |
0.0.3 |
Subtitles (VTT, SRT, PDF, DOCX, HTML, images, etc) to text convertor, with a GUI, great for preprocessing to feed to LLMs |
2025-02-07 19:06:37 |
aspose-total-net |
25.1.0 |
Aspose.Total for Python via .NET is a Document Processing python class library that allows developers to work with Microsoft Word®, Microsoft PowerPoint®, Microsoft Outlook®, OpenOffice®, & 3D file formats without needing Office Automation. |
2025-02-03 18:11:00 |
docling-google-ocr |
2.13.1 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-02-02 06:56:31 |
aspose-html-net |
25.1.0 |
Aspose.HTML for Python via .NET is a powerful API for Python that provides a headless browser functionality, allowing you to work with HTML documents in a variety of ways. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes. |
2025-01-31 17:07:47 |
tikara |
0.1.5 |
The metadata and text content extractor for almost every file type. |
2025-01-26 23:33:40 |
asposepdfcloud |
25.1.0 |
Aspose.PDF Cloud |
2025-01-23 13:31:06 |
aspose-words-cloud |
25.1.0 |
Python Cloud SDK wraps Aspose.Words Cloud API so you could seamlessly integrate Microsoft Word file generation, manipulation, conversion & inspection features into your own python applications. |
2025-01-16 13:54:13 |
aspose-cells |
25.1.0 |
A powerful library for manipulating and converting Excel (XLS, XLSX, XLSB), ODS, CSV and HTML files. |
2025-01-15 13:59:03 |
docowling |
1.0.17 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-01-11 17:29:23 |
leadtools |
23.0.0.4 |
Powered by patented artificial intelligence and machine learning algorithms, LEADTOOLS is a collection of comprehensive toolkits to integrate recognition, document, medical, imaging, and multimedia technologies into desktop, server, tablet, web and mobile solutions. |
2025-01-02 19:30:22 |
groupdocs-total-net |
24.12.0 |
GroupDocs.Total for Python via .NET is an all-in-one suite that provides powerful APIs for document comparison, viewing, and watermarking. This package is designed to enhance your document management capabilities with ease and efficiency, catering to a wide range of file formats and functionalities. |
2024-12-31 19:15:16 |
extended-docling |
2.12.1 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications, now with Google OCR support. |
2024-12-30 09:41:40 |
groupdocs-conversion-net |
24.12 |
File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. |
2024-12-26 19:42:38 |
unstructured-expanded |
0.16.11.post2 |
Expansion to the unstructured package, adding support for image extraction. |
2024-12-21 22:43:01 |